[ROCm][CI] Add large_gpu_mark to test_max_tokens_none for ROCm by AndreasKaratzas · Pull Request #37717 · vllm-project/vllm

AndreasKaratzas · 2026-03-20T21:53:46Z

Follow-up for:

[ROCm][CI] Cleaning and restructuring amd-ci legacy pipeline #34839

Marks max_tokens test with distilbert/distilgpt2 as a large GPU test. Addresses failure in mi250_1: Regression

Motivation: https://buildkite.com/vllm/amd-ci/builds/6721/steps/canvas?sid=019d09d4-708e-44b0-a0d0-ccf0e3c00a94&tab=output

cc @kenroche

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

AndreasKaratzas · 2026-03-20T21:54:12Z

Testing MI250 to see if issue is resolved (added rocm and ready labels).

gemini-code-assist

Code Review

The pull request effectively addresses the CI failure on ROCm platforms by introducing the large_gpu_mark to the test_max_tokens_none test. This ensures that the test is only run on environments with sufficient GPU memory, improving the reliability and accuracy of the test suite. The implementation is clean and correctly leverages pytest.mark.parametrize for conditional marking based on the platform.

AndreasKaratzas · 2026-03-21T00:57:02Z

Test confirmed green: https://buildkite.com/vllm/amd-ci/builds/6745/steps/canvas?sid=019d0d3d-e78c-4b00-be52-475ef32dab3d&tab=output

DarkLight1337 · 2026-03-21T07:25:28Z

+    "model",
+    [
+        pytest.param(
+            "distilbert/distilgpt2",


This model only has 88.2M params, how does it OOM?

Can you add it to #37736?

…project#37717) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

…project#37717) Signed-off-by: Andreas Karatzas <akaratza@amd.com> Signed-off-by: Nithin Chalapathi <nithin.ch10@gmail.com>

…project#37717) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

[ROCm][CI] Add large_gpu_mark to test_max_tokens_none for ROCm

4e2019c

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

AndreasKaratzas added ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm labels Mar 20, 2026

github-project-automation bot moved this to Todo in AMD Mar 20, 2026

github-project-automation bot added this to AMD Mar 20, 2026

gemini-code-assist bot reviewed Mar 20, 2026

View reviewed changes

AndreasKaratzas marked this pull request as ready for review March 21, 2026 00:57

AndreasKaratzas requested a review from DarkLight1337 March 21, 2026 06:46

DarkLight1337 reviewed Mar 21, 2026

View reviewed changes

AndreasKaratzas mentioned this pull request Mar 21, 2026

[CI Failure]: Gemma3 OOMs with transformers backend #37736

Open

DarkLight1337 approved these changes Mar 22, 2026

View reviewed changes

DarkLight1337 merged commit c86b17c into vllm-project:main Mar 22, 2026
16 checks passed

github-project-automation bot moved this from Todo to Done in AMD Mar 22, 2026

AndreasKaratzas deleted the akaratza_fix_regression_test branch March 22, 2026 04:29

RhizoNymph pushed a commit to RhizoNymph/vllm that referenced this pull request Mar 26, 2026

[ROCm][CI] Add large_gpu_mark to test_max_tokens_none for ROCm (vllm-…

e33c330

…project#37717) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

SouthWest7 pushed a commit to SouthWest7/vllm that referenced this pull request Mar 27, 2026

[ROCm][CI] Add large_gpu_mark to test_max_tokens_none for ROCm (vllm-…

17b2b05

…project#37717) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

khairulkabir1661 pushed a commit to khairulkabir1661/vllm that referenced this pull request Mar 27, 2026

[ROCm][CI] Add large_gpu_mark to test_max_tokens_none for ROCm (vllm-…

6b669c7

…project#37717) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

JiantaoXu pushed a commit to JiantaoXu/vllm that referenced this pull request Mar 28, 2026

[ROCm][CI] Add large_gpu_mark to test_max_tokens_none for ROCm (vllm-…

4cc79b8

…project#37717) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

mtparet pushed a commit to blackfuel-ai/vllm that referenced this pull request Apr 9, 2026

[ROCm][CI] Add large_gpu_mark to test_max_tokens_none for ROCm (vllm-…

8e8ae80

…project#37717) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm][CI] Add large_gpu_mark to test_max_tokens_none for ROCm#37717

[ROCm][CI] Add large_gpu_mark to test_max_tokens_none for ROCm#37717
DarkLight1337 merged 1 commit intovllm-project:mainfrom
ROCm:akaratza_fix_regression_test

AndreasKaratzas commented Mar 20, 2026

Uh oh!

AndreasKaratzas commented Mar 20, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

AndreasKaratzas commented Mar 21, 2026

Uh oh!

DarkLight1337 Mar 21, 2026

Uh oh!

DarkLight1337 Mar 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

AndreasKaratzas commented Mar 20, 2026

Uh oh!

AndreasKaratzas commented Mar 20, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

AndreasKaratzas commented Mar 21, 2026

Uh oh!

DarkLight1337 Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants